A Lattice-based Graph Index for Subgraph Search

نویسندگان

  • Dayu Yuan
  • Prasenjit Mitra
چکیده

Given a query graph q, a “subgraph-search” algorithm retrieves from a graph database D all graphs that have q as a subgraph, D(q). Subgraph search is costly because of its involvement of a subgraph-isomorphism test, which is a NPcomplete problem. Graph indexes are used to improve the algorithm efficiency by first filtering out a set of false answers and then verifying each graph that has passed the filtration with subgraph isomorphism tests. Many substructure features have been proposed to build the index aiming at improving the filtering power of the index. In this paper we improve the filtering power and query processing time by design of the index structure. We propose a lattice like index, Lindex, which is generally applicable on all graph features. Lindex achieves a high filtering rate by organizing index subgraphs in a graph lattice and adopting a specific design of value sets. Besides finding the candidate set C(q) after filtering, Lindex can also find a set of true answers Tr(q) without involving subgraph isomorphism tests. Accordingly, only candidate graphs in C(q) − Tr(q) need to be verified. Our experiments show that Lindex outperforms other cutting-edge indexes on both frequent subgraph and infrequent subgraph queries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Labeling Subgraph Embeddings and Cordiality of Graphs

Let $G$ be a graph with vertex set $V(G)$ and edge set $E(G)$, a vertex labeling $f : V(G)rightarrow mathbb{Z}_2$ induces an edge labeling $ f^{+} : E(G)rightarrow mathbb{Z}_2$ defined by $f^{+}(xy) = f(x) + f(y)$, for each edge $ xyin E(G)$.  For each $i in mathbb{Z}_2$, let $ v_{f}(i)=|{u in V(G) : f(u) = i}|$ and $e_{f^+}(i)=|{xyin E(G) : f^{+}(xy) = i}|$. A vertex labeling $f$ of a graph $G...

متن کامل

Mining and Indexing Graphs for Supergraph Search

We study supergraph search (SPS), that is, given a query graph q and a graph database G that contains a collection of graphs , return graphs that have q as a supergraph from G. SPS has broad applications in bioinformatics, cheminformatics and other scientific and commercial fields. Determining whether a graph is a subgraph (or supergraph) of another is an NP-complete problem. Hence, it is intra...

متن کامل

Parallelizing Subgraph Isomorphism Reenement for Classiication and Retrieval of Conceptual Structures 3 a Relation-based Representation For

abstract Major applications of graph-based knowledge representations will require quick response times on extremely large knowledge bases. Although algorithmic developments have provided tremendous improvements in speed, we believe implementation on parallel processors will be needed to meet long-term needs. This paper presents a new parallelization of a subgraph isomorphism reenement algorithm...

متن کامل

The principal ideal subgraph of the annihilating-ideal graph of commutative rings

Let $R$ be a commutative ring with identity and $mathbb{A}(R)$ be the set   of ideals of $R$ with non-zero annihilators. In this paper, we first introduce and investigate the principal ideal subgraph of the annihilating-ideal graph of $R$, denoted by $mathbb{AG}_P(R)$. It is a (undirected) graph with vertices $mathbb{A}_P(R)=mathbb{A}(R)cap mathbb{P}(R)setminus {(0)}$, where   $mathbb{P}(R)$ is...

متن کامل

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases

Many studies have been conducted on seeking the efficient solution for subgraph similarity search over certain (deterministic) graphs due to its wide application in many fields, including bioinformatics, social network analysis, and Resource Description Framework (RDF) data management. All these works assume that the underlying data are certain. However, in reality, graphs are often noisy and u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011